GMM and ARVM cooperation and competition for text-independent speaker recognition on telephone speech

نویسندگان

  • Jean-Luc Le Floch
  • Claude Montacié
  • Marie-José Caraty
چکیده

We develop a cooperation and a competition of two different natures modelizations. The first one, the GMM [1], is a modelization of the parametrisation distribution of the speaker speech. The second, the ARVM [2, 3], is a modelization of the speaker speech spectral evolution. To allow cooperation and competition between different modelizations we use a classical measure normalization. We investigate the cooperation/competition of the GMM and ARVM on two levels : global and analytic. In order to improve the performances, we used results of previous study [4] and repeat the experiments on selected phonetic segments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch

We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identi cation (clo...

متن کامل

Comparison of text - independent speaker recognition methodson telephone speech with

We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identiication (clo...

متن کامل

Text-constrained speaker recognition on a text-independent task

We present an approach to speaker recognition in the textindependent domain of conversational telephone speech using a text-constrained system designed to employ select highfrequency keywords in the speech stream. The system uses speaker word models generated via Hidden Markov Models (HMMs) — a departure from the traditional Gaussian Mixture Model (GMM) approach dominant in text-independent wor...

متن کامل

An Evaluation of DTW, AA and ARVM for Fixed-Text Speaker Identification

Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected ove...

متن کامل

Telephone based speaker recognition using multiple binary classifier and Gaussian mixture models

The present study evaluates MBCM and GMM solutions for both ASV and ASI problems involving text-independent telephone speech from the King speech database. The MBCM's accuracy is enhanced by selectively removing those classi ers within the model which perform worst (pruning). An unpruned MBCM outperforms a GMM for ASV and speakers taken from within the same dialectic region (San Diego, CA). Onc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996